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Patent ^plication ns/07/621, 092C 



SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Kaufman, Randal J. 

Wasley, Louise 

(ii) TITLE OF INVENTION: Method of Increasing Yield of Mature 
Proteins 

(iii) NUMBER OF SEQUENCES: 2 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Genetics Institute, Inc* 

(B) STREET: 87 CambridgePark Drive 

(C) CITY: Cambridge 

(D) STATE: Massachusetts 

(E) COUNTRY: U.S.A. 

(F) ZIP: 02140 



(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.25 



(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 07/621,092 

(B) FILING DATE: 26-NOV-1990 

(C) CLASSIFICATION: 424 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Kapinos, Ellen J. 

(B) REGISTRATION NUMBER: 32,245 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (617)876-1170 

(B) TELEFAX: (617)876-5851 



(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2385 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 
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(A) NAME/KEY: 


GDS 






















55 






(B) LOGATION: 


1. .2382 
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(X) PUBLIGATION INFORMATION: 


















58 






(A) AUTHORS: van den Ouweland, 


f • If • Vf • 
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(G) JOURNAL: Nucleic Acids Res. 
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(D) VOLUME: 18 
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(F) PAGES 


: 664- 
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(G) DATE: 


1990 
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(xi) SEQlfENGE DESGRIPTION: SEQ ID N0:1: 














66 


ATO 


GAG 


GTG 


AGG 


GGG 


TGG 


TTG 


GTA 


TGG 


GTG 


GTA 


GGA 


GGA 


AGA 


GGA 


AGG 


48 


67 


Met 


Glu 


Leu 


Arg 


Pro 


Trp 


Phe 


Leu 


Trp 


Val 


Val 


* i w 


Pro 


Thr 


Gly Thr 




V o 


1 








5 










10 










15 






69 




































70 


TT6 


GTG 


GTG 


GTA 


GGA 


GGT 


GAT 


GGT 


GAG 


GGG 


GAG 


AAG 


GTG 


TTG 


AGG 


AAG 


96 


71 


XiGU 


Val 


Leu 


Leu 


Ala 


Ala 


Asp 


Ala 


Gin 


Gly 


Gin 




Val 


Phe 


Thr 


Asn 




72 








20 










25 


















73 




































74 


AGO 


TGG 


GCT 


GTG 


GGG 


ATG 


GGT 


GGA 


GGG 


GGA 


GGG 


GTG 


GGG 


AAG 


AGT 


GTG 


144 


75 


Thr 


Trp 


Ala 


Val 


Arg 


He 


Pro 


Gly 


Gly 


Pro 


Ala 


Val 


Ala 


Asn 


Ser 


Val 




76 






35 










40 




















77 




































78 


6CA 


GGG 


AAG 


GAT 


GGG 


TTG 


GTG 


AAG 


GTG 


GGG 


GAG 


ATG 


TTG 


GGG 


GAG 


TAT 


192 


79 


Ala 


Arg 


Lys 


His 


Gly 


Phe 


Leu 


Asn 


Leu 


Gly 


Gin 


Tie 


Phe 


Glv 

VX J 


Asp Tyr 




80 




50 










55 










60 












81 




































82 


TAG 


GAG 


TTG 


TGG 


GAT 


GGA 


GGA 


GTG 


AGG 


AAG 


GGG 


TGG 


GTG 


TGG 


GGT 


GAG 


240 


83 




His 


Phe 


Trp 


His 


Arg 


Gly 


Val 


Thr 


Lys 


Arg 


Cor* 

OCX 


Leu 


Ser 


Pro 


His 




84 


65 










70 










75 










80 




85 




































86 


C6C 


GGG 


GGG 


GAG 


AGG 


GGG 


GTG 


GAG 


AGG 


GAG 


GGT 


GAA 


GTA 


GAG 


TGG 


GTG 


288 


87 


Arcr 


Pro 


Arg 


His 


Ser 


Arg 


Leu 


Gin 


Arg 


Glu 


Pro 


Gin 


Val 


Gin 


Trp Leu 




88 










85 










90 










95 






89 




































90 


GAA 


GAG 


GAG 


GTG 


GGA 


AAG 


GGA 


GGG 


AGT 


AAA 


GGG 


GAG 


GTG 


TAG 


GAG 


GAG 


336 


91 


Glu 


Gin 


Gin 


Val 


Ala 


Lys 


Arg 


Arg 


Thr 


Lys 


Arg 


ASD 


Val 


Tvr 


Gin 


Glu 




92 








100 










105 










110 








93 




































94 


GGG 


AGA 


GAG 


GGG 


AAG 


TTT 


GGT 


GAG 


GAG 


TGG 


TAG 


GTG 


TGT 


GGT 


GTG 


AGT 


384 


95 


Pro 


Thr 


Asp 


Pro 


Lys 


Phe 


Pro 


Gin 


Gin 


Trp 


Tyr 


Leu 


Ser 


Glv 


Val 


Thr 




96 






115 










120 










125 










97 




































98 


GAG 


GGG 


GAG 


GTG 


AAT 


GTG 


AAG 


GGG 


GGG 


TGG 


GGG 


GAG 


GGG 


TAG 


AGA 


GGG 


432 


99 


Gin 


Arg 


Asp 


Leu 


Asn 


Val 


Lys 


Ala 


Ala 


Trp 


Ala 


Gin 


Gly 


Tyr 


Thr Gly 




100 




130 










135 










140 












101 




































102 


GAG 


GGG 


ATT 


GTG 


GTG 


TGG 


ATT 


GTG 


GAG 


GAT 


GGG 


ATG 


GAG 


AAG 


AAG 


GAG 


480 


103 


His 


Gly 


He 


Leu 


Val 


Ser 


He 


Leu 


Asp 


Asp 


Gly 


He 


Glu 


Lys 


Asn 


His 




104 


145 










150 










155 










160 




105 




































106 


GGG 


GAG 


TTG 


GGA 


GGG 


AAT 


TAT 


GAT 


GCT 


GGG 


GGG 


AGT 


TTT 


GAT 


GTG 


AAT 


528 
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107 


Piro 


Asp Leu 


Ala Gly Asn 


xy t 


Asp 


Pro 


Olv 

vxy 


Ala 


Ser 


Phe 


Hxs 


Val 


Asn 












165 










1 70 
X / u 










175 

X / 9 








































110 


GAC 


wnv 


GAC 


CCT GAC 


CCC 


CAG 


CCT 


CGG 


TAC 


ACA 


CAG 


ATG 


AAT 


GAC 


AAC 


576 


111 

X X A 




Rl n 


A en 
Ao^ 


Pro Asp 


Pro 


Gin 


Pro 


Aifcr 
AZ-g 


xyr 


Thr 


Gin 


Me^ 


Asn 


Asp 


Asn 












180 








1 ft5 

X O 9 










1 9n 

X 7 U 








113 


































114 


AGG 






ACA CGG 


TGT 


GCG 


GGG 


GAA 


GTG 


OCT 


GCC 


GTG 


GCC 


AAC 


AAC 


624 


lis 

X X w 




His 


filv 


Thr Arg Cys 


AIa 
nxo 


Clv 

VX 


Clu 
vx u 


Va 1 
vox 


Al a 


Ala 


Va 1 

V ClX 


Ala 
AXa 


A en 

AoU 


A en 

AOU 




116 






195 

X 7 ^ 








200 










205 










1 17 

X X f 


































lift 

X xo 


COT 


OTP 
vxw 


XvX 


CCT CTA 
vvX vXA 


CCT 
vvX 


CTC 

VXw 


CPP 


TAP 
xnw 


AAP 


CPP 


PCP 


ATT 

AX X 


CCA 

WA 


CCC 


CTP 
vx^ 


672 

V / A 


119 

X X 7 




Val 
V ax 


X 


Clv VaI 
wXjf vax 


Clv 

wxy 


vox 


Ala 
AX a 


xyr 


A en 
AbU 


Al a 

AXCH 


Aver 

Arg 


Tl A 
X XB 


Clv 

«xy 


Clv 
vx jf 


Val 
vox 




120 




210 








215 










220 












121 

X A X 


































122 


CGC 


AXw 


PTC 


GAT GGC 


GAG 


GTG 


ACA 


GAT 


GCA 


GTG 


GAG 


GCA 


CGC 


TCG 


CTG 


720 


153 

X A w 


Arg 




T.A11 


Asp Gly 


Glu 


Vn 1 
VaX 


XUx 


Asp 


Ala 
AX a 


17a 1 
VaX 


cl 11 
vXU 


Ala 
AXa 


Arg 


Qav 

OCX 


Leu 




124 


225 








230 










2"? R 

^ J 9 










2An 




125 
XX w 


































126 


GGC 




a&p 

AAWr 


CCC AAC 


CAC 


ATC 


CAC 


ATC 


TAC 


AGT 


GCC 


AGC 


TGG 


GGC 


CCC 


768 


1 27 

Xa / 


Cl V 
ux y 


Leu 


A en 


Pro Asn 


His 


X xe 


TT-i e 
uXo 


T 1 A 

X xe 


Tyr 


d6x 


a 1 a 

Axa 


dex 


Trp 


cl V 

vxy 


Pro 




128 








245 










250 










255 

^ 9 w 






1 29 

X A 7 


































130 


GAG 


CAT 
vnx 


CAP 

wAW 


GGC AAG 


ACA 


GTG 


GAT 


GGG 


CCA 


GCC 


CGG 


CTC 


GCC 


GAG 


GAG 


816 


131 

XaJ X 


vx u 


Asp 


Asp 


Gly Lys 


Thr 


Va 1 

V CftX 


A en 
Aap 


CI V 

WX 


IT X O 


Al a 
AX a 


Arg 


Leu 


Ala 
AXa 


cl 11 
vXU 


cl 11 
vXU 




1 2 

X J A 








260 


















Z iU 








133 


































134 


GCC 


TTC 


TTC 


CGT GGG 


CTT 


AGC 


CAG 


GGC 


CGA 


GGG 


GCrG 


CTG 


GGC 


TCC 


ATC 

AX\^ 


864 

o w V 


135 


Ala 


Phe 


Phe 


Arg Gly 


Leu 


OCX 


Cl n 
wxu 


Cl V 
wxy 


Arg 


cl V 

VX Jf 


cl V 

vx 


Leu 


cl V 
vX 


Ser 


TIa 
X xc 




136 






275 








280 










285 

A o w 










137 


































138 


TTT 


GTC 


TGG 


GCC TCG 


GGG 


AAC 


GGG 


G€rG 


CGG 


GAA 


CAT 


GAC 


AGC 


TGC 


AAC 


912 


139 


Phe 


Val 


Trp 


Ala Ser Gly 


Aisn 


Clv 
\»xy 


Clv 


Arg 


Clu 
uxu 


xixa 


A en 

Ao^ 


Sat- 


PiF e 


A en 

AoU 




140 




290 








295 










300 












141 


































142 


TGC 


GAC 


GGC 


TAC ACC 


AAC 


AGT 


ATC 


TAC 


ACG 


CTG 


TCC 


ATC 


AGC 


AGC 


GCC 


960 


143 




Asp Gly 


Tyr Thr Asn 


Ser 


Tie 

X xo 


Tv** 
xyr 


xux 


T.A11 


Sat- 

OCX 


TIa 
X xo 


S A1^ 
OCX 


OCX 


Ala 
AX a 




144 


305 








310 










7 1 R 
w X9 










320 




145 


































146 


ACG 


CAG 


TTT 


GGC AAC 


GTG 


CCG 


TGG 


TAC 


AGC 


GAG 


GCC 


TGC 


TCG 


TCC 


ACA 


1008 

XwV w 


147 


xux 


Gin 


Phe 


Gly Asn Val 


It XU 


Trp 


Tyr 


Ser 


cl 11 
vXU 


Ala 
AXa 


Cys 


Qav> 
OCX 


Ser 


TH 

xnr 




148 








325 










330 

*J W w 










335 
w <j w 






149 


































150 


CTG 


GCC 


ACG 


ACC TAC 


AGC 


AGT 


GGC 


AAC 


CAG 


AAT 


GAG 


AAG 


CAG 


ATC 


GTG 


1056 


151 


Leu 


Ala 


Thr 


Thr Tyr 


Ser 


Ser 


Gly 


Asn 


Gin 


Asn 


Glu 


Lys 


Gin 


He 


Val 




152 








340 








345 










350 








153 


































154 


ACG 


ACT 


GAC 


TTG CGG 


CAG 


AAG 


TGC 


ACG 


GAG 


TCT 


CAC 


ACG 


GGC 


ACC 


TCA 


1104 


155 


Thr 


Thr 


Asp 


Leu Arg 


Gin 


Lys 


Cys 


Thr 


Glu 


Ser 


His 


Thr 


Gly 


Thr 


Ser 




156 






355 








360 










365 










157 


































158 


GCC 


TCT 


GCC 


CCC TTA 


GCA 


GCC 


GGC 


ATC 


ATT 


OCT 


CTC 


ACC 


CTG 


GAG 


GCC 


1152 


159 


Ala 


Ser 


Ala 


Pro Leu 


Ala 


Ala 


Gly 


He 


He 


Ala 


Leu 


Thr 


Leu 


Glu 


Ala 
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X w w 




370 










375 










380 












161 

X w X 




































162 


AAT 


AAG 


AAC 


CTC 


ACA 


TGG 


CGG 


GAC 


ATG 


CAA 


CAC 


CTG 


GTG 


GTA 


CAG 


ACC 


1200 




Asn 


Lys 


Asn 


Leu 


XUx 


Trp 


Arg 


Asp 


ne w 


til n 
vXU 


nxs 


Leu 


Ma 1 

vax 


Ma 1 
vax 


Gl n 
wXU 


Thy 
XUx 




1 6d 












390 

•J 7 V 










39S 

w 7 3 










dOO 




1 6S 




































1 66 
XDO 


XWV 


nun 


PPA 


*ipp 


PAP 


PTP 


AAT 
AAX 


<3PP 


AAP 
AAW 


OAP 


TGO 
Xuv 


GPP 


APP 


AAT 
AAX 


GGT 
wwX 


GTG 
wXw 


1 2dA 
x^ vo 


1 

Xo / 


s>ex 


Lys 


Pro 


IL 1 A 

nX& 


UxS 


Leu 


Asn 


B 1 a 

AX a 


Asn 


Asp 


Trp 


Ala 

AX a 


xnr 


Asn 


wxy 


\7a 1 

vax 




1 6fi 

XDO 










AOS 










din 

V Xtl 










d 1 s 

% X3 






1 6Q 

XD 7 




































170 


WW 


\ir\J\J 


AAA 


6TO 
wxu 


AOP 


CAP 


TP A 


TAT 
xnx 


OOP 


TAP 


ooo 

Www 


PTT 
WX X 


TTP 
X X w 


GAC 

wAW 


OCA 
wwn 


ooc 

www 


1296 

X A 7 w 


X / X 


vXy 


Arg 


Lys 


17a 1 

vax 


oer 


ni c 
UxS 


Cav- 

oer 


Tyr 


/11 

vxy 


Tyr 


d TT 

vxy 


Leu 


DVi A 

irne 


Asp 


Al a 

AX a 


d V 
wxy 




172 








A20 

V A W 










A2«; 

V ^ 3 










v3 w 








173 

X / w 




































1 7A 

X « V 




nx V 


UXv 


flpp 


PTO 
wXw 


OPP 


PAO 

^Av 


AAT 

AAX 


TOO 
XVw 


APP 


APA 

AWA 


OTP 
uxw 


OPP 
www 


PPP 
www 


PAO 
WAW 


POO 
www 


Xw V V 


1 7S 

X / 9 


& 1 A 


Jul A 4- 


V Gl X 


&1 A 

AX a 


Leu 


Al n 
AX& 


vXu 


Asn 


Trp 


T>1 

xnx 


Th V- 

xnx 


Vn 1 

vax 


Ala 
AX a 


IrxO 


d n 

wxn 


Arg 




176 

X / w 






V J 9 




















ddS 
V v3 










177 
X / / 




































1 7S 
X / o 




TCP 


ATP 
AXV« 


ATP 


CAP 
vA^ 


ATP 
AX\» 


PTP 
wX w 


APP 
A^w 


CAC 
uAv 


ppp 


AAA 
AAA 


GAP 


ATP 
AXV« 


GGG 
www 


AAA 
AAA 


PGG 
^ww 


1 ^92 
X3 7^ 


1 7Q 

X / 7 


Lys 


Cys 


Tl <* 
XX6 


T 1 ** 

xxe 


Asp 


T 1 A 

xxe 


ijeu 


xnr 


vXU 


pro 


Lys 


Asp 


T 1 

xxe 


Gxy 


Lys 


Arg 




1 AO 
xou 




dRO 










43 3 










d6n 












1 fil 

X O X 




































X O A 


PTP 
\«x\« 


HAP 


PTP 
v»x ^ 


PHA 


AAP 

AA^ 


APP 


OTO 
wxv 


APP 


OPO 


TPP 


PTO 

^X V 


GGP 


GAG 
vAv 


PPP 


AAP 
AA^ 


PAP 
WAw 


1 AA(\ 
X v«U 


xo J 


XiGU 


Asp 


Lgu 


Arg 


Asn 


xnr 




xnr 


Axa 


Ser 


Leu 


GXy 


(jXU 


Fro 


Asn 


nXS 




18d 
X o V 


d6^ 

V w w 










d7n 










d7 s 

« / 3 










d Afl 




185 




































186 


ATC 


ACT 


CGG 


CTO 


OAO 


CAC 


OCT 


CAO 


oco 

www 


coo 

v« w 


CTC 
wxw 


ACC 

AWW 


PTO 
wxu 


TPP 
xww 


TAT 

XAX 


AAT 

AAX 


1 dAA 
X voo 


1 ft7 
XO / 


T 1 

xxe 


Xux 


Arg 


Leu 


n1 «i 
vXU 


nxs 


& 1 a 

AX a 


vxn 


Ala 

Axa 


Arg 


Leu 


xnr 


Xieu 


Ser 


Tyr 


Asn 




1 fiA 
xoo 










dAS 

409 










4 7U 










473 






189 




































190 


CGC 


CGT 


GGC 


GAC 


CTG 


GCC 


ATC 


CAC 


CTO 

WX V 


OTP 


AOC 


CCC 
www 


ATO 
nx V 


OOP 
www 


APC 

MWW 


PGG 
www 


1S36 

X3 w 


191 

X 7 X 


Arg 


Arg 


ux y 


Asjp 


Leu 


a 1 n 
AXS 


T 1 A 

X xe 


UXS 


Leu 


\7a 1 

vax 


Ga v> 

oer 


Pro 


Ma^ 

ne b 


wxy 


xnr 


Arg 




192 








500 










SOS 

3 W3 










sin 

3 Xw 








193 




































194 


TCC 


ACC 


CTO 

\^X V 


CTO 

WX V 


CPA 


GPP 


AOO 

AW 


PPA 


PAT 


OAP 


TAP 

XAW 


TPP 

xw^ 


GPA 
VWA 


GAT 
WAX 


GGG 
www 


TxT 


1 SAd 

X30 V 


19^ 

X 7 9 


OCX 


XXlx 


Leu 


Leu 


A 1 n 

Axa 


Al A 

Axa 


Arg 


Pro 


nxs 


Asp 


Tyr 


Ca v> 

oer 


Ala 

Axa 


Asp 


wxy 


Irne 




196 






SIS 

^ X w 










3 










S9 S 
3^3 










197 




































198 


AAT 


GAC 


TGG 


GCC 


TTP 

XX\# 


ATO 

nx V 


ACA 


ACT 

A wX 


CAT 
wnx 


TCC 

X WW 


TOO 

Xww 


GAT 

wAX 


GAP 

UAW 


GAT 

WAX 


PPP 

www 


TPT 
X wx 


1 6'^2 

XDO A 


199 

X 7 7 


A cn 


A er\ 


Trp 


A 1 n 
AX eft 


PViA 


new 


xhx 


Tll*> 

xhx 


nxs 


Ser 


Trp 


Asp 


Asp 


Asp 


Pro 


s>er 




200 




530 










53S 

9 w 9 










SdO 

3 












201 




































202 


GG6 


GAG 


TGG 


GTC 


CTA 


GAG 


ATT 


GAA 


AAC 


ACC 


AOC 


GAA 


GCC 
www 


AAC 


AAC 


TAT 

XAX 


16AO 
X W ow 


203 


Glv 

uxy 


vxu 




Ma 1 
Vox 


XjcU 


vXU 


T 1 A 

X xe 


vXU 


Asn 


xnr 


Ser 


d 11 

vXU 


Ala 

AX a 


Asn 


Asn 


Tyr 




204 


545 










550 










555 










560 




205 




































206 


G6G 


ACG 


CTG 


ACC 


AAC 


TCC 


ACC 


CTC 


GTA 


CTC 


TAT 


GGC 


ACC 


GCC 


CCT 


GAC 


1728 


207 


Gly 


Thr 


Leu 


Thr 


Asn 


Ser 


Thr 


Leu 


Val 


Leu 


Tyr 


Gly 


Thr 


Ala 


Pro 


Asp 




208 










565 










570 










575 






209 




































210 


GGG 


CTC 


CCC 


GTA 


CCT 


CCA 


GAA 


AGC 


AGT 


GGC 


TGC 


AAG 


ACC 


CTC 


ACG 


TCC 


1776 


211 


Gly 


Leu 


Pro 


Val 


Pro 


Pro 


Glu 


Ser 


Ser 


Gly 


Cys 


Lys 


Thr 


Leu 


Thr 


Ser 




212 








580 










585 










590 
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313 




































A X v 


ACT 


CAO 


GCC 


TGT 

X wX 


GTG 

wX V 


GTG 

WX V 


TGC 


GAG 


GAA 


GGC 


TTC 


TCC 


CTC 


CAC 


CAG 


AAG 


1824 




0t3x 


Gl Tk 
VXU 


A1 A 


^ y B 


V ax 


Va 1 
wax 


Cw e 


G1 u 

wXU 


Gl u 

VX u 


Gl V 

VX 


PhA 

^UB 


SBX 




Wis 
n X 0 


Gin 

VXU 






216 

X X V 






595 

w 7 w 










600 










605 










217 




































21 A 


ARC 


TGT 

X vX 


GTC 

wX W 


GAG 

WAV 


GAC 


TGC 

XwW 


CCT 


CCA 


GGC 


TTC 
XX v« 


GCC 


CCC 


CAA 


GTC 


CTC 


GAT 


1872 


219 

X X 7 


oox 


Owe 


vox 


Gin 

VXU 


His 


uys 


F X U 


PX u 


Gl V 


Phe 

F Ut5 


AIa 
nxa 


P v*n 

f xu 


Gin 

VXU 


VaI 
V ax 


T.Alt 


A en 

Aoj/ 




220 

A X w 




610 

0 xu 










615 

W X W 










620 












221 

A A X 




































222 


ACQ 


GAG 


TAT 
xnx 


AGC 


ACC 


GAG 


AAT 
nnx 


GAC 


GTG 

VX V 


GAG 
vnv 


ACC 


ATC 
nxw 


CGG 


GCC 


AGC 


GTC 


1920 


XX W 


XUx 


Tti e 


Tyr 


Ser 


xhx 


nl 11 

wXU 


Asn 


Asp 


\7a 1 
Vox 


til 11 
vXU 


xhx 


T 1 A 
X X6 


Arg 


& 1 a 
AXa 


Ser 


Va 1 
VaX 




22A 
X X V 


625 
w X 9 










630 

V W 










635 

w J 3 










640 
0 V w 




225 

X X ^ 




































226 


T6C 


GCC 


CCC 


TGC 


CAC 


GCC 


TCA 


TGT 


GCC 


ACA 


TGC 


CAG 


GGG 


CCG 


GCC 


CTG 


1968 


227 

XX / 


s 


Ala 


Pro 


Cys 


His 


Ala 


Cav 

oex 


Cys 


Al n 
nXa 


Xux 


Cys 


Gl n 
vXU 


Gl V 

vxy 


Pro 


Ala 
AXa 


Leu 




22ft 

X X o 










645 










650 










655 






22Q 

X X 7 




































230 

X J 1/ 


APA 


GAC 


TGC 


CTC 


AGC 


TGC 




AGC 


cac 


GCC 
v^^ 


TCC 

x^^ 


main 
TTO 


GAC 
vn^ 


CCT 


GTG 
vXv 


GAG 

VAV 


201 6 

xi/ XD 


xO X 


Xux 


Asp 


Cys 


Leu 


Ser 


Cys 


IT ro 


oer 


nxS 


Ala 

Axa 


oer 


Leu 


Asp 


irro 


t7a 1 

vax 


vXU 




232 

X W X 








660 










0 D 3 










670 








233 

X J J 




































234 


CAO 


ACT 


TGC 


TCC 


CGG 


CAA 


AGC 


CAG 


AGC 


AGC 

AVW 


CGA 


GAG 

VAV 


TCC 


CCG 


CCA 

WWA 


CAG 

\«AV 


2064 
xu w 


23«; 




Thr Cys 


Ser Arg Gin 


OCX 


ni n 
vxn 


OBX 


Ser 


Arg 


ni 11 

vXU 


OBX 


Pro 


Pro 


vxn 




236 

X V 






675 










0 0 w 










6ft5 
0 0 w 










237 




































238 


CA6 


CAG 


CCA 


CCT 


CGG 


CTG 


CCC 


CCG 


GAG 


GTG 


GAG 


GCG 


GGG 


CAA 


CGG 


CTG 


2112 


239 

X 7 


Gin 


Gin 


Pro 


Pro 


Arg Leu 


IT XV 


PT*n 

It XW 


Gl u 

VXU 


Va 1 
V ax 


Gl u 

VXU 


A1 A 

Axa 


Gl V 

uxy 


Gl n 

VXU 


A 

Arg 


T.All 




240 




690 










695 

w 7 w 










700 












241 




































242 


C60 


GCA 


GGG 


CTG 


CTG 


CCC 


TCA 


CAC 


CTG 


CCT 


GAG 


GTG 


GTG 


GCC 


GGC 


CTC 


2160 


2d3 

X V W 


Arg 


Ala Gly 


Leu 


Leu 


Pro 


Ser 


TTi e 
XIxB 


Leu 


Pro 


vXU 


Va 1 
VaX 


Va 1 
VaX 


Ala 
AXa 


/11 V 

vxy 


Leu 




244 

X V V 


705 

/WW 










710 










715 
/ X9 










720 

/ X w 




245 




































246 


AGC 


TGC 


GCC 


TTC 


ATC 


GTG 


CTG 


GTC 


TTC 


GTC 


ACT 


GTC 


TTC 


CTG 


CTC 


CTG 


2208 


247 

X V f 


Ot3X 


Cys 


Ala 


Phe 


He 


Val 


Leu 


Va 1 
VaX 


PVia 


Vn 1 
VaX 


XUx 


Va 1 
VaX 


P>1A 


Leu 


Leu 


Leu 




248 










725 










7 30 










735 






249 




































250 


CA6 


CTG 


CGC 


TCT 


GGC 


TTT 


AGT 


TTT 


CGG 


GGG 


GTG 


AAG 


GTG 


TAC 


ACC 


ATG 


2256 


251 


Gin 

wXU 


Leu Arg 


Ser 


Gly Phe 


OCX 


It Ut? 


Arg 


Gl xr 
vxy 


Va 1 
VaX 


Ly s 


Va 1 
VaX 


Tyr 


XUx 






252 








740 










745 










750 








253 




































254 


GAC 


CGT 


GGC 


CTC 


ATC 


TCC 


TAC 


AAG 


GGG 


CTG 


CCC 


CCT 


GAA 


GCC 


TGG 


CAG 


2304 


255 


A en 


Arg Gly Leu 


He 


Ser 


Tyr 




ni V 
vxy 


Leu 


Pro 


Pro 


Gl 11 
VXU 


Ala 
AXa 


Trp 


Gl n 
vXu 




x3 D 






755 










/ dU 










*T £ R 










257 




































258 


6AO 


GAG 


TGC 


CCG 


TCT 


GAC 


TCA 


6AA 


GAG 


GAC 


GAG 


GGC 


CGG 


GGC 


GAG 


AGG 


2352 


259 


Glu 


Glu 


Cys 


Pro 


Ser Asp 


Ser 


Glu 


Glu 


Asp 


Glu 


Gly 


Arg 


Gly 


Glu 


Arg 




260 




770 










775 










780 












261 




































262 


ACC 


GCC 


TTT 


ATC 


AAA 


GAC 


CAG 


AGC 


GCC 


CTC 


TGA 












2385 


263 


Thr 


Ala 


Phe 


lie Lys Asp 


Gin 


Ser 


Ala 


Leu 
















264 


785 










790 
























265 
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266 

267 (2) INFORMATION FOR SEQ ID NO: 2: 
268 

269 (i) SEQUENCE CHARACTERISTICS: 

270 (A) LENGTH: 794 amino acids 

271 (B) TYPE: amino acid 

272 (D) TOPOLOGY: linear 
273 

274 (ii) MOLECULE TYPE: protein 

275 

276 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

277 

278 Met Glu Leu Arg Pro Trp Phe Leu Trp Val Val Pro Pro Thr Gly Thr 

279 1 5 10 15 
280 

281 Leu Val Leu Leu Ala Ala Asp Ala Gin Gly Gin Lys Val Phe Thr Asn 

282 20 25 30 
283 

284 Thr Trp Ala Val Arg He Pro Gly Gly Pro Ala Val Ala Asn Ser Val 

285 35 40 45 
286 

287 Ala Arg Lys His Gly Phe Leu Asn Leu Gly Gin He Phe Gly Asp Tyr 

288 50 55 60 
289 

290 Tyr His Phe Trp His Arg Gly Val Thr Lys Arg Ser Leu Ser Pro His 

291 65 70 75 80 
292 

293 Arg Pro Arg His Ser Arg Leu Gin Arg Glu Pro Gin Val Gin Trp Leu 

294 85 90 95 
295 

296 Glu Gin Gin Val Ala Lys Arg Arg Thr Lys Arg Asp Val Tyr Gin Glu 

297 100 105 110 
298 

299 Pro Thr Asp Pro Lys Phe Pro Gin Gin Trp Tyr Leu Ser Gly Val Thr 

300 115 120 125 
301 

302 Gin Arg Asp Leu Asn Val Lys Ala Ala Trp Ala Gin Gly Tyr Thr Gly 

303 130 135 140 
304 

305 His Gly He Leu Val Ser He Leu Asp Asp Gly He Glu Lys Asn His 

306 145 150 155 160 
307 

308 Pro Asp Leu Ala Gly Asn Tyr Asp Pro Gly Ala Ser Phe His Val Asn 

309 165 170 175 
310 

311 Asp Gin Asp Pro Asp Pro Gin Pro Arg Tyr Thr Gin Met Asn Asp Asn 

312 180 185 190 
313 

314 Arg His Gly Thr Arg Cys Ala Gly Glu Val Ala Ala Val Ala Asn Asn 

315 195 200 205 
316 

317 Arg Val Cys Gly Val Gly Val Ala Tyr Asn Ala Arg He Gly Gly Val 

318 210 215 220 
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319 

320 Arg Met Leu Asp Gly Glu Val Thr Asp Ala Val Glu Ala Arg Ser Leu 

321 225 230 235 240 
322 

323 Oly Leu Asn Pro Asn His lie His lie Tyr Ser Ala Ser Trp Gly Pro 

324 245 250 255 
325 

326 Glu Asp Asp Gly Lys Thr Val Asp Gly Pro Ala Arg Leu Ala Glu Glu 

327 260 265 270 
328 

329 Ala Phe Phe Arg Gly Leu Ser Gin Gly Arg Gly Gly Leu Gly Ser lie 

330 275 280 285 
331 

332 Phe Val Trp Ala Ser Gly Asn Gly Gly Arg Glu His Asp Ser Cys Asn 

333 290 295 300 
334 

335 Cys Asp Gly Tyr Thr Asn Ser lie Tyr Thr Leu Ser lie Ser Ser Ala 

336 305 310 315 320 
337 

338 Thr Gin Phe Gly Asn Val Pro Trp Tyr Ser Glu Ala Cys Ser Ser Thr 

339 325 330 335 
340 

341 Leu Ala Thr Thr Tyr Ser Ser Gly Asn Gin Asn Glu Lys Gin lie Val 

342 340 345 350 
343 

344 Thr Thr Asp Leu Arg Gin Lys Cys Thr Glu Ser His Thr Gly Thr Ser 

345 355 360 365 
346 

347 Ala Ser Ala Pro Leu Ala Ala Gly lie lie Ala Leu Thr Leu Glu Ala 

348 370 375 380 
349 

350 Asn Lys Asn Leu Thr Trp Arg Asp Met Gin His Leu Val Val Gin Thr 

351 385 390 395 400 
352 

353 Ser Lys Pro Ala His Leu Asn Ala Asn Asp Trp Ala Thr Asn Gly Val 

354 405 410 415 
355 

356 Gly Arg Lys Val Ser His Ser Tyr Gly Tyr Gly Leu Phe Asp Ala Gly 

357 420 425 430 
358 

359 Ala Met Val Ala Leu Ala Gin Asn Trp Thr Thr Val Ala Pro Gin Arg 

360 435 440 445 
361 

362 Lys Cys lie lie Asp lie Leu Thr Glu Pro Lys Asp lie Gly Lys Arg 

363 450 455 460 
364 

365 Leu Asp Leu Arg Asn Thr Val Thr Ala Ser Leu Gly Glu Pro Asn His 

366 465 470 475 480 
367 

368 lie Thr Arg Leu Glu His Ala Gin Ala Arg Leu Thr Leu Ser Tyr Asn 

369 485 490 495 
370 

371 Arg Arg Gly Asp Leu Ala lie His Leu Val Ser Pro Met Gly Thr Arg 
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372 500 505 510 

373 

374 Ser Thr Leu Leu Ala Ala Arg Pro His Asp Tyr Ser Ala Asp Gly Phe 

375 515 520 525 
376 

377 Asn Asp Trp Ala Phe Met Thr Thr His Ser Trp Asp Asp Asp Pro Ser 

378 530 535 540 
379 

380 Gly Glu Trp Val Leu Glu lie Glu Asn Thr Ser Glu Ala Asn Asn Tyr 

381 545 550 555 560 
382 

383 Gly Thr Leu Thr Asn Ser Thr Leu Val Leu Tyr Gly Thr Ala Pro Asp 

384 565 570 575 
385 

386 Gly Leu Pro Val Pro Pro Glu Ser Ser Gly Cys Lys Thr Leu Thr Ser 

387 580 585 590 
388 

389 Ser Gin Ala Cys Val Val Cys Glu Glu Gly Phe Ser Leu His Gin Lys 

390 595 600 605 
391 

392 Ser Cys Val Gin His Cys Pro Pro Gly Phe Ala Pro Gin Val Leu Asp 

393 610 615 620 
394 

395 Thr His Tyr Ser Thr Glu Asn Asp Val Glu Thr lie Arg Ala Ser Val 

396 625 630 635 640 
397 

398 Cys Ala Pro Cys His Ala Ser Cys Ala Thr Cys Gin Gly Pro Ala Leu 

399 645 650 655 
400 

401 Thr Asp Cys Leu Ser Cys Pro Ser His Ala Ser Leu Asp Pro Val Glu 

402 660 665 670 
403 

404 Gin Thr Cys Ser Arg Gin Ser Gin Ser Ser Arg Glu Ser Pro Pro Gin 

405 675 680 685 
406 

407 Gin Gin Pro Pro Arg Leu Pro Pro Glu Val Glu Ala Gly Gin Arg Leu 

408 690 695 700 
409 

410 Arg Ala Gly Leu Leu Pro Ser His Leu Pro Glu Val Val Ala Gly Leu 

411 705 710 715 720 
412 

413 Ser Cys Ala Phe lie Val Leu Val Phe Val Thr Val Phe Leu Leu Leu 

414 725 730 735 
415 

416 Gin Leu Arg Ser Gly Phe Ser Phe Arg Gly Val Lys Val Tyr Thr Met 

417 740 745 750 
418 

419 Asp Arg Gly Leu lie Ser Tyr Lys Gly Leu Pro Pro Glu Ala Trp Gin 

420 755 760 765 
421 

422 Glu Glu Cys Pro Ser Asp Ser Glu Glu Asp Glu Gly Arg Gly Glu Arg 

423 770 775 780 
424 



Page: 9 Raw Seqaence Listing 06/04/91 

12:12:27 

Patent l^^plication US/07/621, 092C 



425 Thr Ala Phe lie Lys Asp Gin Ser Ala Leu 

426 785 790 
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LINE ERROR 



ORIGINAL TEXT 



29 Wrong application Serial Number 

30 Wrong Filing Date 

31 Wrong Classification 




(A) APPLICATION NUMBER: US 07/621,092 

(B) FILING DATE: 26-NOV-1990 

(C) CLASSIFICATION: 424 
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